Investigations into early and late reflections on distant-talking speech recognition toward suitable reverberation criteria

نویسندگان

  • Takanobu Nishiura
  • Yoshiki Hirano
  • Yuki Denda
  • Masato Nakayama
چکیده

Reverberation-robust speech recognition has become very important in the recognition of distant-talking speech. However, as no common reverberation criteria for the recognition of reverberantspeech have been proposed, it has been difficult to estimate this. We have thus focused on a reverberation criterion for the recognition of distant-talking speech. The reverberation time is generally currently used as a reverberation criterion for the recognition of distant-talking speech. This is unique and does not depend on the position of the source in a room. However, distant-talking speech recognition greatly depends on the location of the talker relative to that of the microphone and the distance between them. We investigated a suitable reverberation criterion with the ISO3382 acoustic parameters for distant-talking speech recognition to overcome this problem. We first calculated distant-talking speech recognition with early and late reflections based on the impulse response between the talker and microphone. As a result, we found that early reflections within about 12.5 ms from the duration of direct sound contributed slightly to distant-talking speech recognition in non-noisy environments. We then evaluated it based on ISO3382 acoustic parameters. We consequently confirmed that the ISO3382 acoustic parameters are strong candidates for the new reverberation criteria for distant-talking speech recognition.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Performance estimation of reverberant speech recognition based on reverberant criteria RSR-dn with acoustic parameters

Reverberation-robust speech recognition has become very important in the field of distant-talking speech recognition. However, as no common reverberation criteria for the recognition of reverberant speech have yet been proposed, it has been difficult to estimate its effectiveness. To address this problem in 2007, we investigated early and late reflections on distanttalking speech recognition to...

متن کامل

Distant-Talking Speech Recognition Based on Spectral Subtraction by Multi-Channel LMS Algorithm

We propose a blind dereverberation method based on spectral subtraction using a multi-channel least mean squares (MCLMS) algorithm for distant-talking speech recognition. In a distant-talking environment, the channel impulse response is longer than the short-term spectral analysis window. By treating the late reverberation as additive noise, a noise reduction technique based on spectral subtrac...

متن کامل

Deep neural network-based bottleneck feature and denoising autoencoder-based dereverberation for distant-talking speaker identification

Deep neural network (DNN)-based approaches have been shown to be effective in many automatic speech recognition systems. However, few works have focused on DNNs for distant-talking speaker recognition. In this study, a bottleneck feature derived from a DNN and a cepstral domain denoising autoencoder (DAE)-based dereverberation are presented for distant-talking speaker identification, and a comb...

متن کامل

Temporal-spatial processing of a single speech reflection in normal-hearing and hearing-impaired listeners

Reflections and reverberation considerably affect speech recognition in rooms. Early reflections of the speech signal have mostly been found to enhance speech intelligibility [1, 2, 3, 4], while late reflections can have detrimental effects [4, 5]. A better understanding of the mechanisms underlying speech perception in noisy and reverberant conditions can be important for various practical inv...

متن کامل

A Combined Approach for Estimating a Feature-domain Reverberation Model in Non-diffuse Environments

A combined approach for estimating a feature-domain reverberation model suitable for the robust distant-talking automatic speech recognition concept REMOS (REverberation MOdeling for Speech recognition) [1] is proposed. Based on a few calibration utterances recorded in the target environment, the combined approach employs ML estimation and blind estimation of the reverberation time to determine...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007